首页> 外文OA文献 >Sequential category aggregation and partitioning approaches for multi-way contingency tables based on survey and census data
【2h】

Sequential category aggregation and partitioning approaches for multi-way contingency tables based on survey and census data

机译:顺序类别聚合和分区方法   基于调查和人口普查数据的多向列联表

代理获取
本网站仅为用户提供外文OA文献查询和代理获取服务,本网站没有原文。下单后我们将采用程序或人工为您竭诚获取高质量的原文,但由于OA文献来源多样且变更频繁,仍可能出现获取不到、文献不完整或与标题不符等情况,如果获取不到我们将提供退款服务。请知悉。

摘要

Large contingency tables arise in many contexts but especially in thecollection of survey and census data by government statistical agencies.Because the vast majority of the variables in this context have a large numberof categories, agencies and users need a systematic way of constructing tableswhich are summaries of such contingency tables. We propose such an approach inthis paper by finding members of a class of restricted log-linear models whichmaximize the likelihood of the data and use this to find a parsimonious meansof representing the table. In contrast with more standard approaches for modelsearch in hierarchical log-linear models (HLLM), our procedure systematicallyreduces the number of categories of the variables. Through a series ofexamples, we illustrate the extent to which it can preserve the interactionstructure found with HLLMs and be used as a data simplification procedure priorto HLL modeling. A feature of the procedure is that it can easily be applied tomany tables with millions of cells, providing a new way of summarizing largedata sets in many disciplines. The focus is on information and descriptionrather than statistical testing. The procedure may treat each variable in thetable in different ways, preserving full detail, treating it as fully nominal,or preserving ordinality.
机译:较大的列联表在许多情况下都出现了,但尤其是在政府统计机构收集调查和普查数据的过程中。由于在这种情况下,绝大多数变量具有大量类别,因此,代理和用户需要系统地构建表的方式,这些表是这样的列联表。我们在本文中通过找到一类限制对数线性模型的成员来提出这种方法,该模型使数据的可能性最大化,并使用该模型来找到表示该表的简约方法。与在分层对数线性模型(HLLM)中进行模型搜索的更多标准方法相比,我们的过程系统地减少了变量类别的数量。通过一系列示例,我们说明了它可以保留与HLLM建立的交互结构的程度,并可以用作HLL建模之前的数据简化程序。该过程的一个特点是,它可以轻松地应用于具有数百万个单元格的许多表,从而提供了一种在许多学科中汇总大数据集的新方法。重点是信息和描述,而不是统计测试。该过程可能以不同的方式在表中处理每个变量,保留完整的细节,将其视为完全标称的,或保留序数。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
代理获取

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号